Overview
Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 298418 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 74.8 MiB |
| Average record size in memory | 263.0 B |
Variable types
| Categorical | 4 |
|---|---|
| Boolean | 1 |
| Numeric | 12 |
| DateTime | 3 |
rate_code is highly overall correlated with MTA_tax | High correlation |
Dropoff_latitude is highly overall correlated with Pickup_latitude | High correlation |
Trip_distance is highly overall correlated with Fare_amount and 1 other fields | High correlation |
Fare_amount is highly overall correlated with Total_amount and 1 other fields | High correlation |
MTA_tax is highly overall correlated with rate_code | High correlation |
Tip_amount is highly overall correlated with Payment_type | High correlation |
Total_amount is highly overall correlated with Fare_amount and 1 other fields | High correlation |
Payment_type is highly overall correlated with Tip_amount | High correlation |
Pickup_longitude is highly overall correlated with Dropoff_longitude | High correlation |
Pickup_latitude is highly overall correlated with Dropoff_latitude | High correlation |
Dropoff_longitude is highly overall correlated with Pickup_longitude | High correlation |
Store_and_fwd_flag is highly imbalanced (95.0%) | Imbalance |
MTA_tax is highly imbalanced (92.6%) | Imbalance |
Payment_type is highly imbalanced (51.6%) | Imbalance |
Trip_type is highly imbalanced (90.5%) | Imbalance |
Dropoff_latitude is highly skewed (γ1 = -48.70178852) | Skewed |
Fare_amount is highly skewed (γ1 = 68.48804575) | Skewed |
Extra is highly skewed (γ1 = 22.79680221) | Skewed |
Tolls_amount is highly skewed (γ1 = 313.2344187) | Skewed |
Total_amount is highly skewed (γ1 = 51.65401062) | Skewed |
Pickup_longitude is highly skewed (γ1 = -316.8293882) | Skewed |
Pickup_latitude is highly skewed (γ1 = -52.95918059) | Skewed |
Dropoff_longitude is highly skewed (γ1 = -288.9412035) | Skewed |
Trip_distance has 8235 (2.8%) zeros | Zeros |
Extra has 142027 (47.6%) zeros | Zeros |
Tip_amount has 213588 (71.6%) zeros | Zeros |
Tolls_amount has 291112 (97.6%) zeros | Zeros |
Reproduction
| Analysis started | 2025-11-25 13:23:51.994923 |
|---|---|
| Analysis finished | 2025-11-25 13:24:51.338744 |
| Duration | 59.34 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 236026 | |
| 1 | 62392 | 20.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 236026 | |
| 1 | 62392 | 20.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 236026 | |
| 1 | 62392 | 20.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 298418 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 236026 | |
| 1 | 62392 | 20.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 298418 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 236026 | |
| 1 | 62392 | 20.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 298418 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 236026 | |
| 1 | 62392 | 20.9% |
Store_and_fwd_flag
Boolean
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 291.6 KiB |
| False | |
|---|---|
| True | 1657 |
| Value | Count | Frequency (%) |
| False | 296761 | |
| True | 1657 | 0.6% |
rate_code
Real number (ℝ)
High correlation
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1250863 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.67654351 |
|---|---|
| Coefficient of variation (CV) | 0.60132589 |
| Kurtosis | 28.148521 |
| Mean | 1.1250863 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.4521015 |
| Sum | 335746 |
| Variance | 0.45771112 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 287273 | |
| 5 | 8501 | 2.8% |
| 2 | 2138 | 0.7% |
| 3 | 402 | 0.1% |
| 4 | 69 | < 0.1% |
| 6 | 35 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 287273 | |
| 2 | 2138 | 0.7% |
| 3 | 402 | 0.1% |
| 4 | 69 | < 0.1% |
| 5 | 8501 | 2.8% |
| 6 | 35 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 35 | < 0.1% |
| 5 | 8501 | 2.8% |
| 4 | 69 | < 0.1% |
| 3 | 402 | 0.1% |
| 2 | 2138 | 0.7% |
| 1 | 287273 |
Dropoff_latitude
Real number (ℝ)
High correlation Skewed
| Distinct | 61717 |
|---|---|
| Distinct (%) | 20.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.768527 |
| Minimum | 25.685133 |
|---|---|
| Maximum | 41.628765 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 25.685133 |
|---|---|
| 5-th percentile | 40.678139 |
| Q1 | 40.734463 |
| median | 40.76577 |
| Q3 | 40.808352 |
| 95-th percentile | 40.857346 |
| Maximum | 41.628765 |
| Range | 15.943632 |
| Interquartile range (IQR) | 0.073889732 |
Descriptive statistics
| Standard deviation | 0.062869721 |
|---|---|
| Coefficient of variation (CV) | 0.0015421141 |
| Kurtosis | 11242.576 |
| Mean | 40.768527 |
| Median Absolute Deviation (MAD) | 0.038631439 |
| Skewness | -48.701789 |
| Sum | 12166062 |
| Variance | 0.0039526018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.75817871 | 47 | < 0.1% |
| 40.75819016 | 39 | < 0.1% |
| 40.75818634 | 39 | < 0.1% |
| 40.77436066 | 39 | < 0.1% |
| 40.76839828 | 38 | < 0.1% |
| 40.7743187 | 38 | < 0.1% |
| 40.77425385 | 38 | < 0.1% |
| 40.75816345 | 38 | < 0.1% |
| 40.77428818 | 37 | < 0.1% |
| 40.75813675 | 37 | < 0.1% |
| Other values (61707) | 298028 |
| Value | Count | Frequency (%) |
| 25.68513298 | 1 | |
| 36.13668442 | 1 | |
| 37.37200546 | 1 | |
| 38.7928009 | 1 | |
| 38.79285049 | 1 | |
| 38.91895294 | 1 | |
| 38.92712784 | 1 | |
| 38.92715073 | 1 | |
| 40.31602097 | 1 | |
| 40.35122299 | 1 |
| Value | Count | Frequency (%) |
| 41.62876511 | 1 | |
| 41.34133911 | 1 | |
| 41.22953796 | 1 | |
| 41.1880722 | 1 | |
| 41.1633873 | 1 | |
| 41.14956665 | 1 | |
| 41.14128876 | 1 | |
| 41.13276291 | 1 | |
| 41.11446381 | 1 | |
| 41.11336136 | 1 |
Passenger_count
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5577311 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 101 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 5 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.277056 |
|---|---|
| Coefficient of variation (CV) | 0.81981798 |
| Kurtosis | 3.5373219 |
| Mean | 1.5577311 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.2405681 |
| Sum | 464855 |
| Variance | 1.6308719 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 236206 | |
| 5 | 26025 | 8.7% |
| 2 | 23048 | 7.7% |
| 3 | 6773 | 2.3% |
| 6 | 3498 | 1.2% |
| 4 | 2752 | 0.9% |
| 0 | 101 | < 0.1% |
| 7 | 9 | < 0.1% |
| 8 | 4 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 101 | < 0.1% |
| 1 | 236206 | |
| 2 | 23048 | 7.7% |
| 3 | 6773 | 2.3% |
| 4 | 2752 | 0.9% |
| 5 | 26025 | 8.7% |
| 6 | 3498 | 1.2% |
| 7 | 9 | < 0.1% |
| 8 | 4 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | < 0.1% |
| 8 | 4 | < 0.1% |
| 7 | 9 | < 0.1% |
| 6 | 3498 | 1.2% |
| 5 | 26025 | 8.7% |
| 4 | 2752 | 0.9% |
| 3 | 6773 | 2.3% |
| 2 | 23048 | 7.7% |
| 1 | 236206 | |
| 0 | 101 | < 0.1% |
Trip_distance
Real number (ℝ)
High correlation Zeros
| Distinct | 2408 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.976382 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 8235 |
| Zeros (%) | 2.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.4 |
| Q1 | 1.09 |
| median | 1.99 |
| Q3 | 3.84 |
| 95-th percentile | 8.6 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 2.75 |
Descriptive statistics
| Standard deviation | 2.9987373 |
|---|---|
| Coefficient of variation (CV) | 1.0075109 |
| Kurtosis | 17.016139 |
| Mean | 2.976382 |
| Median Absolute Deviation (MAD) | 1.11 |
| Skewness | 2.834292 |
| Sum | 888205.96 |
| Variance | 8.9924254 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8235 | 2.8% |
| 1 | 3071 | 1.0% |
| 0.9 | 3006 | 1.0% |
| 1.2 | 2952 | 1.0% |
| 1.1 | 2906 | 1.0% |
| 0.8 | 2821 | 0.9% |
| 1.3 | 2767 | 0.9% |
| 1.4 | 2744 | 0.9% |
| 0.7 | 2555 | 0.9% |
| 1.5 | 2435 | 0.8% |
| Other values (2398) | 264926 |
| Value | Count | Frequency (%) |
| 0 | 8235 | |
| 0.01 | 306 | 0.1% |
| 0.02 | 241 | 0.1% |
| 0.03 | 209 | 0.1% |
| 0.04 | 154 | 0.1% |
| 0.05 | 122 | < 0.1% |
| 0.06 | 111 | < 0.1% |
| 0.07 | 100 | < 0.1% |
| 0.08 | 83 | < 0.1% |
| 0.09 | 90 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 1 | |
| 62.18 | 1 | |
| 60.3 | 1 | |
| 58.3 | 1 | |
| 56 | 1 | |
| 54.75 | 1 | |
| 53.2 | 1 | |
| 43.89 | 1 | |
| 43.53 | 1 | |
| 42.03 | 1 |
Fare_amount
Real number (ℝ)
High correlation Skewed
| Distinct | 344 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.398314 |
| Minimum | 0 |
|---|---|
| Maximum | 2794.5 |
| Zeros | 2108 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 6.5 |
| median | 9.5 |
| Q3 | 15.5 |
| 95-th percentile | 30 |
| Maximum | 2794.5 |
| Range | 2794.5 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 11.273051 |
|---|---|
| Coefficient of variation (CV) | 0.90924065 |
| Kurtosis | 15163.647 |
| Mean | 12.398314 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 68.488046 |
| Sum | 3699880 |
| Variance | 127.08168 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.5 | 14211 | 4.8% |
| 6 | 14200 | 4.8% |
| 5.5 | 13611 | 4.6% |
| 7 | 13600 | 4.6% |
| 5 | 12991 | 4.4% |
| 7.5 | 12514 | 4.2% |
| 8 | 11975 | 4.0% |
| 8.5 | 10781 | 3.6% |
| 4.5 | 10366 | 3.5% |
| 9 | 9947 | 3.3% |
| Other values (334) | 174222 |
| Value | Count | Frequency (%) |
| 0 | 2108 | |
| 0.01 | 36 | < 0.1% |
| 0.02 | 4 | < 0.1% |
| 0.03 | 5 | < 0.1% |
| 0.05 | 4 | < 0.1% |
| 0.07 | 2 | < 0.1% |
| 0.08 | 4 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.1 | 13 | < 0.1% |
| 0.11 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2794.5 | 1 | |
| 1912.5 | 1 | |
| 503 | 1 | |
| 444 | 1 | |
| 350 | 1 | |
| 250 | 2 | |
| 200 | 1 | |
| 184.5 | 1 | |
| 180 | 2 | |
| 172 | 1 |
Extra
Real number (ℝ)
Skewed Zeros
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.35395043 |
| Minimum | 0 |
|---|---|
| Maximum | 54.67 |
| Zeros | 142027 |
| Zeros (%) | 47.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.5 |
| Q3 | 0.5 |
| 95-th percentile | 1 |
| Maximum | 54.67 |
| Range | 54.67 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.44915874 |
|---|---|
| Coefficient of variation (CV) | 1.2689877 |
| Kurtosis | 2033.202 |
| Mean | 0.35395043 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 22.796802 |
| Sum | 105625.18 |
| Variance | 0.20174358 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 142027 | |
| 0.5 | 103004 | |
| 1 | 53315 | 17.9% |
| 8 | 12 | < 0.1% |
| 5 | 5 | < 0.1% |
| 7.5 | 5 | < 0.1% |
| 10 | 5 | < 0.1% |
| 12 | 4 | < 0.1% |
| 2 | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| Other values (25) | 34 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 142027 | |
| 0.01 | 2 | < 0.1% |
| 0.02 | 3 | < 0.1% |
| 0.5 | 103004 | |
| 0.51 | 1 | < 0.1% |
| 0.52 | 1 | < 0.1% |
| 0.6 | 1 | < 0.1% |
| 0.75 | 1 | < 0.1% |
| 1 | 53315 | 17.9% |
| 1.5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 54.67 | 1 | |
| 45 | 2 | |
| 42 | 1 | |
| 34.33 | 1 | |
| 30.5 | 1 | |
| 30 | 2 | |
| 25 | 1 | |
| 23 | 1 | |
| 22.22 | 1 | |
| 17 | 1 |
MTA_tax
Categorical
High correlation Imbalance
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.1 MiB |
| 0.5 | |
|---|---|
| 0.0 | 7558 |
| 0.4 | 4 |
| 0.25 | 3 |
| 0.6 | 2 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.0000101 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.5 |
|---|---|
| 2nd row | 0.5 |
| 3rd row | 0.5 |
| 4th row | 0.5 |
| 5th row | 0.5 |
Common Values
| Value | Count | Frequency (%) |
| 0.5 | 290851 | |
| 0.0 | 7558 | 2.5% |
| 0.4 | 4 | < 0.1% |
| 0.25 | 3 | < 0.1% |
| 0.6 | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.5 | 290851 | |
| 0.0 | 7558 | 2.5% |
| 0.4 | 4 | < 0.1% |
| 0.25 | 3 | < 0.1% |
| 0.6 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 305976 | |
| . | 298418 | |
| 5 | 290854 | |
| 4 | 4 | < 0.1% |
| 2 | 3 | < 0.1% |
| 6 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 895257 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 305976 | |
| . | 298418 | |
| 5 | 290854 | |
| 4 | 4 | < 0.1% |
| 2 | 3 | < 0.1% |
| 6 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 895257 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 305976 | |
| . | 298418 | |
| 5 | 290854 | |
| 4 | 4 | < 0.1% |
| 2 | 3 | < 0.1% |
| 6 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 895257 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 305976 | |
| . | 298418 | |
| 5 | 290854 | |
| 4 | 4 | < 0.1% |
| 2 | 3 | < 0.1% |
| 6 | 2 | < 0.1% |
Tip_amount
Real number (ℝ)
High correlation Zeros
| Distinct | 871 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.88196513 |
| Minimum | 0 |
|---|---|
| Maximum | 210.08 |
| Zeros | 213588 |
| Zeros (%) | 71.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 4.7 |
| Maximum | 210.08 |
| Range | 210.08 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2.1264953 |
|---|---|
| Coefficient of variation (CV) | 2.4110878 |
| Kurtosis | 989.78495 |
| Mean | 0.88196513 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.872708 |
| Sum | 263194.27 |
| Variance | 4.5219824 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 213588 | |
| 1 | 9388 | 3.1% |
| 2 | 8508 | 2.9% |
| 1.5 | 4565 | 1.5% |
| 3 | 3975 | 1.3% |
| 2.5 | 2397 | 0.8% |
| 4 | 2048 | 0.7% |
| 5 | 1836 | 0.6% |
| 1.8 | 1555 | 0.5% |
| 1.4 | 1465 | 0.5% |
| Other values (861) | 49093 | 16.5% |
| Value | Count | Frequency (%) |
| 0 | 213588 | |
| 0.01 | 130 | < 0.1% |
| 0.02 | 39 | < 0.1% |
| 0.03 | 17 | < 0.1% |
| 0.04 | 4 | < 0.1% |
| 0.05 | 30 | < 0.1% |
| 0.06 | 10 | < 0.1% |
| 0.07 | 3 | < 0.1% |
| 0.08 | 24 | < 0.1% |
| 0.09 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 210.08 | 1 | |
| 200 | 1 | |
| 175 | 1 | |
| 150 | 1 | |
| 113.77 | 1 | |
| 110 | 1 | |
| 100 | 1 | |
| 99.45 | 1 | |
| 96 | 1 | |
| 88.25 | 1 |
Tolls_amount
Real number (ℝ)
Skewed Zeros
| Distinct | 86 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.14094438 |
| Minimum | 0 |
|---|---|
| Maximum | 950 |
| Zeros | 291112 |
| Zeros (%) | 97.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 950 |
| Range | 950 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.392929 |
|---|---|
| Coefficient of variation (CV) | 16.977825 |
| Kurtosis | 115511.92 |
| Mean | 0.14094438 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 313.23442 |
| Sum | 42060.34 |
| Variance | 5.7261091 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 291112 | |
| 5.33 | 6482 | 2.2% |
| 2.44 | 265 | 0.1% |
| 7.5 | 106 | < 0.1% |
| 10.66 | 84 | < 0.1% |
| 8.25 | 67 | < 0.1% |
| 9 | 51 | < 0.1% |
| 11 | 45 | < 0.1% |
| 10.25 | 34 | < 0.1% |
| 2 | 24 | < 0.1% |
| Other values (76) | 148 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 291112 | |
| 0.01 | 2 | < 0.1% |
| 0.02 | 1 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.5 | 16 | < 0.1% |
| 1 | 4 | < 0.1% |
| 1.2 | 1 | < 0.1% |
| 1.25 | 1 | < 0.1% |
| 1.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 950 | 1 | |
| 750 | 1 | |
| 73 | 1 | |
| 46.5 | 1 | |
| 45 | 1 | |
| 33.16 | 1 | |
| 33 | 1 | |
| 28.25 | 1 | |
| 27.33 | 1 | |
| 24 | 1 |
Total_amount
Real number (ℝ)
High correlation Skewed
| Distinct | 2392 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.262719 |
| Minimum | 0 |
|---|---|
| Maximum | 2796 |
| Zeros | 1980 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 7.5 |
| median | 11 |
| Q3 | 17.5 |
| 95-th percentile | 34.88 |
| Maximum | 2796 |
| Range | 2796 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 12.612077 |
|---|---|
| Coefficient of variation (CV) | 0.88426881 |
| Kurtosis | 9827.2323 |
| Mean | 14.262719 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 51.654011 |
| Sum | 4256251.9 |
| Variance | 159.06449 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 12616 | 4.2% |
| 8 | 12137 | 4.1% |
| 6.5 | 11769 | 3.9% |
| 7.5 | 11446 | 3.8% |
| 6 | 10916 | 3.7% |
| 9 | 10171 | 3.4% |
| 8.5 | 10054 | 3.4% |
| 10 | 9534 | 3.2% |
| 9.5 | 9108 | 3.1% |
| 5.5 | 9046 | 3.0% |
| Other values (2382) | 191621 |
| Value | Count | Frequency (%) |
| 0 | 1980 | |
| 0.01 | 31 | < 0.1% |
| 0.02 | 3 | < 0.1% |
| 0.03 | 4 | < 0.1% |
| 0.05 | 4 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.08 | 3 | < 0.1% |
| 0.09 | 2 | < 0.1% |
| 0.1 | 10 | < 0.1% |
| 0.12 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 2796 | 1 | |
| 1914 | 1 | |
| 961 | 1 | |
| 770.5 | 1 | |
| 504.5 | 1 | |
| 445.5 | 1 | |
| 350 | 1 | |
| 348.5 | 1 | |
| 250.5 | 2 | |
| 228.08 | 1 |
Payment_type
Categorical
High correlation Imbalance
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 1110 |
| 4 | 765 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 199290 | |
| 1 | 97253 | |
| 3 | 1110 | 0.4% |
| 4 | 765 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 199290 | |
| 1 | 97253 | |
| 3 | 1110 | 0.4% |
| 4 | 765 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 199290 | |
| 1 | 97253 | |
| 3 | 1110 | 0.4% |
| 4 | 765 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 298418 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 199290 | |
| 1 | 97253 | |
| 3 | 1110 | 0.4% |
| 4 | 765 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 298418 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 199290 | |
| 1 | 97253 | |
| 3 | 1110 | 0.4% |
| 4 | 765 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 298418 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 199290 | |
| 1 | 97253 | |
| 3 | 1110 | 0.4% |
| 4 | 765 | 0.3% |
Trip_type
Categorical
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| 2 | |
|---|---|
| 1 | 3635 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 294783 | |
| 1 | 3635 | 1.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 294783 | |
| 1 | 3635 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 294783 | |
| 1 | 3635 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 298418 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 294783 | |
| 1 | 3635 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 298418 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 294783 | |
| 1 | 3635 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 298418 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 294783 | |
| 1 | 3635 | 1.2% |
Pickup_longitude
Real number (ℝ)
High correlation Skewed
| Distinct | 23898 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.925267 |
| Minimum | -121.92611 |
|---|---|
| Maximum | -73.020638 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 298418 |
| Negative (%) | 100.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | -121.92611 |
|---|---|
| 5-th percentile | -73.978615 |
| Q1 | -73.953369 |
| median | -73.938515 |
| Q3 | -73.902977 |
| 95-th percentile | -73.844124 |
| Maximum | -73.020638 |
| Range | 48.905472 |
| Interquartile range (IQR) | 0.05039215 |
Descriptive statistics
| Standard deviation | 0.12421378 |
|---|---|
| Coefficient of variation (CV) | -0.0016802615 |
| Kurtosis | 115530.23 |
| Mean | -73.925267 |
| Median Absolute Deviation (MAD) | 0.02030945 |
| Skewness | -316.82939 |
| Sum | -22060630 |
| Variance | 0.015429063 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.84427643 | 478 | 0.2% |
| -73.8442688 | 471 | 0.2% |
| -73.84425354 | 442 | 0.1% |
| -73.84429169 | 408 | 0.1% |
| -73.84423828 | 403 | 0.1% |
| -73.84429932 | 379 | 0.1% |
| -73.84428406 | 345 | 0.1% |
| -73.84426117 | 319 | 0.1% |
| -73.84423065 | 305 | 0.1% |
| -73.84424591 | 292 | 0.1% |
| Other values (23888) | 294576 |
| Value | Count | Frequency (%) |
| -121.9261093 | 1 | |
| -115.1791 | 1 | |
| -80.31391144 | 1 | |
| -77.06020355 | 1 | |
| -77.0594635 | 1 | |
| -76.97978973 | 1 | |
| -76.97966766 | 1 | |
| -76.958992 | 1 | |
| -75.59073639 | 1 | |
| -74.4307251 | 1 |
| Value | Count | Frequency (%) |
| -73.02063751 | 1 | |
| -73.19286346 | 1 | |
| -73.24085999 | 1 | |
| -73.25282288 | 1 | |
| -73.32118988 | 1 | |
| -73.42676544 | 1 | |
| -73.52334595 | 1 | |
| -73.52476501 | 1 | |
| -73.5295105 | 1 | |
| -73.53754425 | 1 |
Pickup_latitude
Real number (ℝ)
High correlation Skewed
| Distinct | 51729 |
|---|---|
| Distinct (%) | 17.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.774183 |
| Minimum | 25.684929 |
|---|---|
| Maximum | 41.628765 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 25.684929 |
|---|---|
| 5-th percentile | 40.687202 |
| Q1 | 40.734811 |
| median | 40.775486 |
| Q3 | 40.810806 |
| 95-th percentile | 40.855912 |
| Maximum | 41.628765 |
| Range | 15.943836 |
| Interquartile range (IQR) | 0.075995445 |
Descriptive statistics
| Standard deviation | 0.061193477 |
|---|---|
| Coefficient of variation (CV) | 0.0015007898 |
| Kurtosis | 12545.94 |
| Mean | 40.774183 |
| Median Absolute Deviation (MAD) | 0.036024094 |
| Skewness | -52.959181 |
| Sum | 12167750 |
| Variance | 0.0037446416 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.72133636 | 348 | 0.1% |
| 40.72132874 | 342 | 0.1% |
| 40.72132111 | 300 | 0.1% |
| 40.72135162 | 285 | 0.1% |
| 40.72134018 | 270 | 0.1% |
| 40.72135544 | 254 | 0.1% |
| 40.72136688 | 239 | 0.1% |
| 40.72133255 | 236 | 0.1% |
| 40.72131348 | 228 | 0.1% |
| 40.72134399 | 213 | 0.1% |
| Other values (51719) | 295703 |
| Value | Count | Frequency (%) |
| 25.68492889 | 1 | |
| 36.13709641 | 1 | |
| 37.37200165 | 1 | |
| 38.7929039 | 1 | |
| 38.79291534 | 1 | |
| 38.9189415 | 1 | |
| 38.92731094 | 2 | |
| 40.46934891 | 1 | |
| 40.52040863 | 1 | |
| 40.57337189 | 1 |
| Value | Count | Frequency (%) |
| 41.62876511 | 1 | |
| 41.13275909 | 1 | |
| 41.03786087 | 1 | |
| 40.99474335 | 1 | |
| 40.98063278 | 1 | |
| 40.97935486 | 1 | |
| 40.9702301 | 1 | |
| 40.96680069 | 1 | |
| 40.96501541 | 1 | |
| 40.9609108 | 1 |
Dropoff_longitude
Real number (ℝ)
High correlation Skewed
| Distinct | 31326 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.926735 |
| Minimum | -121.9259 |
|---|---|
| Maximum | -69.347466 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 298418 |
| Negative (%) | 100.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | -121.9259 |
|---|---|
| 5-th percentile | -73.992905 |
| Q1 | -73.960419 |
| median | -73.938316 |
| Q3 | -73.89711 |
| 95-th percentile | -73.829468 |
| Maximum | -69.347466 |
| Range | 52.578438 |
| Interquartile range (IQR) | 0.06330871 |
Descriptive statistics
| Standard deviation | 0.12805712 |
|---|---|
| Coefficient of variation (CV) | -0.0017322167 |
| Kurtosis | 102264.39 |
| Mean | -73.926735 |
| Median Absolute Deviation (MAD) | 0.02974701 |
| Skewness | -288.9412 |
| Sum | -22061068 |
| Variance | 0.016398626 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.9391098 | 66 | < 0.1% |
| -73.93917847 | 66 | < 0.1% |
| -73.93914032 | 61 | < 0.1% |
| -73.93741608 | 60 | < 0.1% |
| -73.93907166 | 60 | < 0.1% |
| -73.93756866 | 59 | < 0.1% |
| -73.93740845 | 58 | < 0.1% |
| -73.93916321 | 58 | < 0.1% |
| -73.93913269 | 56 | < 0.1% |
| -73.94902039 | 55 | < 0.1% |
| Other values (31316) | 297819 |
| Value | Count | Frequency (%) |
| -121.9259033 | 1 | |
| -115.1793365 | 1 | |
| -80.31430054 | 1 | |
| -77.0602951 | 1 | |
| -77.06020355 | 1 | |
| -76.97981262 | 1 | |
| -76.97956085 | 1 | |
| -76.95897675 | 1 | |
| -75.59074402 | 1 | |
| -74.45738983 | 1 |
| Value | Count | Frequency (%) |
| -69.34746552 | 1 | |
| -72.93803406 | 1 | |
| -73.01972961 | 1 | |
| -73.1678772 | 1 | |
| -73.19140625 | 1 | |
| -73.24086761 | 1 | |
| -73.25280762 | 1 | |
| -73.26731873 | 1 | |
| -73.29314423 | 1 | |
| -73.29319 | 1 |
| Distinct | 4534 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| Minimum | 2025-11-25 00:00:01 |
|---|---|
| Maximum | 2025-11-25 23:58:39 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
pickup_datetime
Date
| Distinct | 290160 |
|---|---|
| Distinct (%) | 97.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| Minimum | 2013-01-09 00:03:30 |
|---|---|
| Maximum | 2013-12-31 23:59:57 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
dropoff_datetime
Date
| Distinct | 290640 |
|---|---|
| Distinct (%) | 97.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| Minimum | 2013-01-09 00:13:56 |
|---|---|
| Maximum | 2014-01-01 21:36:54 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Interactions
Correlations
| vendor_id | rate_code | Dropoff_latitude | Passenger_count | Trip_distance | Fare_amount | Extra | MTA_tax | Tip_amount | Tolls_amount | Total_amount | Payment_type | Trip_type | Pickup_longitude | Pickup_latitude | Dropoff_longitude | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| vendor_id | 1.000 | -0.086 | 0.032 | 0.103 | 0.007 | 0.024 | 0.012 | 0.150 | -0.004 | 0.002 | 0.022 | -0.052 | -0.057 | 0.023 | 0.043 | 0.019 |
| rate_code | -0.086 | 1.000 | 0.073 | -0.024 | 0.054 | 0.059 | -0.128 | -0.785 | -0.011 | 0.016 | 0.044 | 0.062 | 0.007 | 0.036 | 0.084 | 0.039 |
| Dropoff_latitude | 0.032 | 0.073 | 1.000 | -0.043 | -0.183 | -0.153 | -0.071 | -0.062 | -0.140 | -0.013 | -0.166 | 0.124 | 0.011 | 0.205 | 0.860 | 0.207 |
| Passenger_count | 0.103 | -0.024 | -0.043 | 1.000 | 0.005 | 0.009 | 0.024 | 0.020 | 0.001 | -0.001 | 0.009 | -0.006 | -0.023 | 0.005 | -0.048 | 0.003 |
| Trip_distance | 0.007 | 0.054 | -0.183 | 0.005 | 1.000 | 0.727 | -0.034 | -0.025 | 0.403 | 0.134 | 0.742 | -0.203 | -0.004 | -0.000 | -0.021 | 0.013 |
| Fare_amount | 0.024 | 0.059 | -0.153 | 0.009 | 0.727 | 1.000 | -0.035 | -0.001 | 0.326 | 0.101 | 0.967 | -0.169 | -0.005 | -0.008 | -0.032 | -0.008 |
| Extra | 0.012 | -0.128 | -0.071 | 0.024 | -0.034 | -0.035 | 1.000 | 0.093 | -0.003 | -0.008 | 0.003 | -0.015 | -0.007 | -0.000 | -0.093 | 0.003 |
| MTA_tax | 0.150 | -0.785 | -0.062 | 0.020 | -0.025 | -0.001 | 0.093 | 1.000 | 0.017 | -0.014 | 0.009 | -0.070 | -0.011 | -0.020 | -0.066 | -0.016 |
| Tip_amount | -0.004 | -0.011 | -0.140 | 0.001 | 0.403 | 0.326 | -0.003 | 0.017 | 1.000 | 0.082 | 0.475 | -0.579 | -0.003 | -0.047 | -0.057 | -0.050 |
| Tolls_amount | 0.002 | 0.016 | -0.013 | -0.001 | 0.134 | 0.101 | -0.008 | -0.014 | 0.082 | 1.000 | 0.293 | -0.031 | -0.000 | -0.003 | 0.010 | 0.007 |
| Total_amount | 0.022 | 0.044 | -0.166 | 0.009 | 0.742 | 0.967 | 0.003 | 0.009 | 0.475 | 0.293 | 1.000 | -0.255 | -0.006 | -0.016 | -0.040 | -0.014 |
| Payment_type | -0.052 | 0.062 | 0.124 | -0.006 | -0.203 | -0.169 | -0.015 | -0.070 | -0.579 | -0.031 | -0.255 | 1.000 | 0.006 | 0.071 | 0.071 | 0.087 |
| Trip_type | -0.057 | 0.007 | 0.011 | -0.023 | -0.004 | -0.005 | -0.007 | -0.011 | -0.003 | -0.000 | -0.006 | 0.006 | 1.000 | -0.007 | 0.009 | -0.005 |
| Pickup_longitude | 0.023 | 0.036 | 0.205 | 0.005 | -0.000 | -0.008 | -0.000 | -0.020 | -0.047 | -0.003 | -0.016 | 0.071 | -0.007 | 1.000 | 0.206 | 0.956 |
| Pickup_latitude | 0.043 | 0.084 | 0.860 | -0.048 | -0.021 | -0.032 | -0.093 | -0.066 | -0.057 | 0.010 | -0.040 | 0.071 | 0.009 | 0.206 | 1.000 | 0.193 |
| Dropoff_longitude | 0.019 | 0.039 | 0.207 | 0.003 | 0.013 | -0.008 | 0.003 | -0.016 | -0.050 | 0.007 | -0.014 | 0.087 | -0.005 | 0.956 | 0.193 | 1.000 |
| Dropoff_latitude | Dropoff_longitude | Extra | Fare_amount | MTA_tax | Passenger_count | Payment_type | Pickup_latitude | Pickup_longitude | Store_and_fwd_flag | Tip_amount | Tolls_amount | Total_amount | Trip_distance | Trip_type | rate_code | vendor_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dropoff_latitude | 1.000 | 0.096 | -0.126 | -0.178 | 0.387 | -0.070 | 0.000 | 0.822 | 0.039 | 0.000 | -0.173 | -0.025 | -0.201 | -0.174 | 0.000 | 0.059 | 0.000 |
| Dropoff_longitude | 0.096 | 1.000 | 0.005 | -0.147 | 0.354 | 0.013 | 0.000 | 0.009 | 0.693 | 0.000 | -0.298 | 0.058 | -0.178 | -0.132 | 0.000 | 0.115 | 0.000 |
| Extra | -0.126 | 0.005 | 1.000 | -0.031 | 0.022 | 0.028 | 0.036 | -0.158 | -0.021 | 0.000 | 0.027 | -0.040 | 0.041 | -0.015 | 0.000 | -0.187 | 0.025 |
| Fare_amount | -0.178 | -0.147 | -0.031 | 1.000 | 0.000 | 0.022 | 0.000 | -0.049 | -0.055 | 0.000 | 0.298 | 0.216 | 0.983 | 0.912 | 0.000 | 0.060 | 0.000 |
| MTA_tax | 0.387 | 0.354 | 0.022 | 0.000 | 1.000 | 0.066 | 0.059 | 0.387 | 0.005 | 0.013 | 0.018 | 0.007 | 0.005 | 0.029 | 0.010 | 0.405 | 0.150 |
| Passenger_count | -0.070 | 0.013 | 0.028 | 0.022 | 0.066 | 1.000 | 0.020 | -0.080 | 0.024 | 0.044 | -0.001 | -0.000 | 0.022 | 0.015 | 0.026 | 0.009 | 0.225 |
| Payment_type | 0.000 | 0.000 | 0.036 | 0.000 | 0.059 | 0.020 | 1.000 | 0.000 | 0.000 | 0.024 | 0.016 | 0.000 | 0.000 | 0.048 | 0.009 | 0.061 | 0.155 |
| Pickup_latitude | 0.822 | 0.009 | -0.158 | -0.049 | 0.387 | -0.080 | 0.000 | 1.000 | 0.048 | 0.000 | -0.103 | 0.033 | -0.073 | -0.044 | 0.000 | 0.105 | 0.000 |
| Pickup_longitude | 0.039 | 0.693 | -0.021 | -0.055 | 0.005 | 0.024 | 0.000 | 0.048 | 1.000 | 0.000 | -0.267 | -0.027 | -0.091 | -0.045 | 0.000 | 0.086 | 0.000 |
| Store_and_fwd_flag | 0.000 | 0.000 | 0.000 | 0.000 | 0.013 | 0.044 | 0.024 | 0.000 | 0.000 | 1.000 | 0.003 | 0.000 | 0.000 | 0.012 | 0.008 | 0.009 | 0.145 |
| Tip_amount | -0.173 | -0.298 | 0.027 | 0.298 | 0.018 | -0.001 | 0.016 | -0.103 | -0.267 | 0.003 | 1.000 | 0.128 | 0.418 | 0.294 | 0.000 | -0.059 | 0.006 |
| Tolls_amount | -0.025 | 0.058 | -0.040 | 0.216 | 0.007 | -0.000 | 0.000 | 0.033 | -0.027 | 0.000 | 0.128 | 1.000 | 0.239 | 0.223 | 0.000 | 0.095 | 0.000 |
| Total_amount | -0.201 | -0.178 | 0.041 | 0.983 | 0.005 | 0.022 | 0.000 | -0.073 | -0.091 | 0.000 | 0.418 | 0.239 | 1.000 | 0.900 | 0.000 | 0.041 | 0.000 |
| Trip_distance | -0.174 | -0.132 | -0.015 | 0.912 | 0.029 | 0.015 | 0.048 | -0.044 | -0.045 | 0.012 | 0.294 | 0.223 | 0.900 | 1.000 | 0.011 | -0.009 | 0.004 |
| Trip_type | 0.000 | 0.000 | 0.000 | 0.000 | 0.010 | 0.026 | 0.009 | 0.000 | 0.000 | 0.008 | 0.000 | 0.000 | 0.000 | 0.011 | 1.000 | 0.008 | 0.057 |
| rate_code | 0.059 | 0.115 | -0.187 | 0.060 | 0.405 | 0.009 | 0.061 | 0.105 | 0.086 | 0.009 | -0.059 | 0.095 | 0.041 | -0.009 | 0.008 | 1.000 | 0.090 |
| vendor_id | 0.000 | 0.000 | 0.025 | 0.000 | 0.150 | 0.225 | 0.155 | 0.000 | 0.000 | 0.145 | 0.006 | 0.000 | 0.000 | 0.004 | 0.057 | 0.090 | 1.000 |
Missing values
Sample
| vendor_id | Store_and_fwd_flag | rate_code | Dropoff_latitude | Passenger_count | Trip_distance | Fare_amount | Extra | MTA_tax | Tip_amount | Tolls_amount | Total_amount | Payment_type | Trip_type | Pickup_longitude | Pickup_latitude | Dropoff_longitude | Difference_between_p_d_time | pickup_datetime | dropoff_datetime | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | N | 1 | 40.759087 | 1 | 1.80 | 9.0 | 0.5 | 0.5 | 0.0 | 0.0 | 10.0 | 2 | 2 | -73.900368 | 40.745861 | -73.925934 | 00:49:52 | 2013-03-11 01:06:00 | 2013-03-11 01:55:52 |
| 1 | 2 | N | 1 | 40.701069 | 1 | 6.36 | 22.0 | 0.5 | 0.5 | 0.0 | 0.0 | 23.0 | 2 | 2 | -73.852242 | 40.715996 | -73.922470 | 23:51:00 | 2013-11-30 00:04:00 | 2013-12-01 23:55:00 |
| 2 | 1 | N | 1 | 40.756233 | 1 | 4.60 | 16.0 | 0.5 | 0.5 | 3.4 | 0.0 | 20.4 | 1 | 2 | -73.954422 | 40.730217 | -73.967667 | 23:42:00 | 2013-11-30 00:05:00 | 2013-12-01 23:47:00 |
| 3 | 2 | N | 1 | 40.680714 | 1 | 2.69 | 10.0 | 1.0 | 0.5 | 0.0 | 0.0 | 11.5 | 2 | 2 | -73.830124 | 40.713730 | -73.811317 | 00:51:27 | 2013-11-12 01:03:11 | 2013-11-13 01:54:38 |
| 4 | 2 | N | 1 | 40.716694 | 1 | 7.21 | 31.0 | 0.5 | 0.5 | 4.0 | 0.0 | 36.0 | 1 | 2 | -73.924507 | 40.761822 | -73.996880 | 23:33:00 | 2013-10-12 00:09:00 | 2013-10-13 23:42:00 |
| 5 | 2 | N | 1 | 40.745071 | 1 | 1.76 | 8.0 | 0.5 | 0.5 | 0.0 | 0.0 | 9.0 | 2 | 2 | -73.915009 | 40.763996 | -73.919434 | 00:48:51 | 2013-11-30 01:09:46 | 2013-12-01 01:58:37 |
| 6 | 2 | N | 1 | 40.761436 | 1 | 6.80 | 24.0 | 0.5 | 0.5 | 0.0 | 0.0 | 25.0 | 2 | 2 | -73.937820 | 40.818451 | -73.984085 | 23:53:00 | 2013-10-31 00:01:00 | 2013-11-01 23:54:00 |
| 7 | 2 | N | 1 | 40.759182 | 5 | 2.86 | 11.0 | 0.5 | 0.5 | 0.0 | 0.0 | 12.0 | 1 | 2 | -73.919167 | 40.758801 | -73.875610 | 23:47:00 | 2013-11-12 00:00:00 | 2013-11-13 23:47:00 |
| 8 | 2 | N | 1 | 40.731438 | 1 | 4.06 | 26.0 | 0.5 | 0.5 | 5.4 | 0.0 | 32.4 | 1 | 2 | -73.951179 | 40.714035 | -74.006516 | 23:38:14 | 2013-10-12 00:04:37 | 2013-10-13 23:42:51 |
| 9 | 1 | N | 1 | 40.669708 | 1 | 3.70 | 15.0 | 0.5 | 0.5 | 0.0 | 0.0 | 16.0 | 1 | 2 | -73.948799 | 40.714195 | -73.931053 | 23:32:00 | 2013-11-30 00:14:00 | 2013-12-01 23:46:00 |
| vendor_id | Store_and_fwd_flag | rate_code | Dropoff_latitude | Passenger_count | Trip_distance | Fare_amount | Extra | MTA_tax | Tip_amount | Tolls_amount | Total_amount | Payment_type | Trip_type | Pickup_longitude | Pickup_latitude | Dropoff_longitude | Difference_between_p_d_time | pickup_datetime | dropoff_datetime | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 298408 | 2 | N | 1 | 40.748978 | 1 | 1.43 | 11.0 | 0.0 | 0.5 | 0.0 | 0.00 | 11.50 | 2 | 2 | -73.871170 | 40.733967 | -73.879402 | 00:16:19 | 2013-11-28 11:10:27 | 2013-11-28 11:26:46 |
| 298409 | 2 | N | 1 | 40.779667 | 1 | 4.68 | 16.0 | 0.0 | 0.5 | 3.2 | 0.00 | 19.70 | 1 | 2 | -73.959473 | 40.808659 | -73.961311 | 00:11:52 | 2013-11-18 22:45:48 | 2013-11-18 22:57:40 |
| 298410 | 1 | N | 1 | 40.783115 | 1 | 6.10 | 18.0 | 1.0 | 0.5 | 0.0 | 0.00 | 19.50 | 2 | 2 | -73.942535 | 40.841541 | -73.944221 | 00:10:36 | 2013-10-19 10:46:09 | 2013-10-19 10:56:45 |
| 298411 | 2 | N | 1 | 40.757404 | 3 | 5.08 | 17.0 | 0.0 | 0.5 | 0.0 | 5.33 | 22.83 | 1 | 2 | -73.938850 | 40.804958 | -73.902962 | 00:15:45 | 2013-12-26 09:07:31 | 2013-12-26 09:23:16 |
| 298412 | 2 | N | 1 | 40.824100 | 1 | 2.76 | 12.5 | 0.0 | 0.5 | 0.0 | 0.00 | 13.00 | 2 | 2 | -73.945763 | 40.807381 | -73.908875 | 00:14:45 | 2013-03-12 12:13:44 | 2013-03-12 12:28:29 |
| 298413 | 2 | N | 1 | 40.676388 | 1 | 1.77 | 8.0 | 0.5 | 0.5 | 0.0 | 0.00 | 9.00 | 2 | 2 | -73.993294 | 40.687912 | -73.967033 | 00:07:49 | 2013-04-12 00:39:55 | 2013-04-12 00:47:44 |
| 298414 | 1 | N | 1 | 40.843239 | 1 | 2.20 | 10.0 | 0.5 | 0.5 | 0.0 | 0.00 | 11.00 | 2 | 2 | -73.938286 | 40.846970 | -73.905930 | 00:10:11 | 2013-12-22 00:53:02 | 2013-12-22 01:03:13 |
| 298415 | 2 | N | 1 | 40.687347 | 1 | 1.81 | 10.0 | 0.0 | 0.5 | 2.0 | 0.00 | 12.50 | 1 | 2 | -73.986870 | 40.702442 | -73.979698 | 00:12:38 | 2013-12-21 13:13:14 | 2013-12-21 13:25:52 |
| 298416 | 2 | N | 1 | 40.727402 | 5 | 0.81 | 4.5 | 0.5 | 0.5 | 1.1 | 0.00 | 6.60 | 1 | 2 | -73.957687 | 40.717800 | -73.957184 | 00:03:13 | 2013-08-28 20:34:45 | 2013-08-28 20:37:58 |
| 298417 | 1 | N | 1 | 40.765503 | 1 | 1.50 | 6.0 | 0.0 | 0.5 | 0.0 | 0.00 | 6.50 | 2 | 2 | -73.917679 | 40.770012 | -73.890411 | 00:02:25 | 2013-03-10 10:30:58 | 2013-03-10 10:33:23 |